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the original documents submitted by the applicant. 
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TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
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COLORED PHOTOS ; 
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GRAY SCALE DOCUMENTS 
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A multi-windowing technique tor threshholding an image using local image properties 



(57) A technique, specif teaOy apparatus and an 
accompanying method, for accurately thresholding an 
image based on local image properties, specif ically lumi- 
nance variations, and particularly such a technique that 
uses multi-windowing for providing enhanced immunity 
to image noise and lessened boundary artifacts. Specif- 
ically, a localized intensity gradient. G(ij). is determined 
for a pre-defined window (300) centered about each 
image pixel(ij). Localized minimum and maximum pixel 
intensity measures. L^ and L^,. respectively are also 
determined for another, though larger, window (330) cen- 
tered about pixel(i.j). Also, a localized area gradient 
measure. GS(ij). is determined as a sum of .nd.vidua 
intensity gradients for a matrix of pixel positions 370) 
centered about pixel position (i j). Each image pixel(ij) is 
then classified as being an object pixel. i.e.. black, or a 
background pixel, i.e.. white, based upon its area gradi- 
ent GS(ij). and associated L™, and L,™ measures. 
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Description 

TFCHNICAL FIF B np ™E INVENTION 

The invention relates to a technique, specifically apparatus and an accompanying method, for accurately thresrv 
olding an image based on local image properties, specifically luminance variations, and particularly one that uses multi- 
windowing for providing enhanced immunity to image noise and lessened boundary artifacts. 

RAr.KnROUND ART 

With digital image processing and digital communication becoming increasingly prevalent today, increasing amounts 
of printed or other textual documents are being scanned for subsequent computerized processing of one form or another 
and/or digital transmission. This processing may involve, for example, optical character recognition, for converting printed 
characters, whether machine printed or handwritten, from scanned bit-mapped form into an appropriate character set, 
such as ASCII, the latter being more suitable for use with word processing and similar computerized document-process- 
ing tasks. 

Scanning a gray-scale document typically yields a multi-bit, typically eight-bit, value for each pixel in the scanned 
document. The value represents the luminance, in terms of a 256-level gray scale, of a pixel at a corresponding point 
in the document. These pixels are generated, depending upon the resolution of the scanner, frequently at resolutions 

?o of 200-400 pixels/inch (approximately 80-1 60 pixels/cm), though with highly detailed images at upwards of 1 200 or more 
pixels/inch (approximately 470 pixels/cm). Consequently, a scanned8 1/2 by 11 inch (approximately 22 by 28 cm) image 
will contain a considerable amount of gray-scale data. Inasmuch scanned text generally presents a written or printed 
characters of some sort against a contrasting colored background, typically white or black print against a white or light 
colored background, or vice versa, the exact luminance value at any one pixel in the text is not as important as whether 

?5 that pixel is either part of a character or the background. Therefore, scanned textual images, or scanned textual portions 
of larger images containing both text and graphics, can be efficiently represented by single-bit pixels in which each pixel 
in a scanned image is simply set to. e.g.. a "one- if that pixel in the original image is part of a character or part of 
foreground information, or to, e.g.. a "zero" if that pixel in the original image is part of the image background. To easily 
distinguish the different types of scanned images, a gray-level image is defined as one having multi-bit (hence multi- 

so value) pixels, whereas a binary (or bi-level) image is formed of single-bit pixels. Furthermore, since binary images gen- t 
erate considerably less data for a given textual image, such as, e.g.. one-eighth, as much as for an eight-bit gray-scale 
rendering of the same image, binary images are more efficient over corresponding gray-scale images and thus preferred 
for storage and communication of textual images. Binary images are also preferred because of their easy compressibility 
using standard compression techniques, e.g.. CCITT Groups 3 or 4 compression standards. 

35 Gray-scale images are converted to binary images through a so-called thresholding process. In essence, each murti- 
bit pixel value in a gray-scale scanned image is compared to a pre-defined threshold value, which may be fixed, variable 
or even adaptively variable, to yield a single corresponding output bit. If the multi-bit pixel value equals or exceeds the 
threshold value for that particular pixel, the resultant single-bit output pixel is set to a "one"; otherwise if the threshold Is 
greater than the multi-bit pixel, then the resultant single-bit output pixel remains at "zero". In this manner, thresholding 

to extracts those pixels, such as those which form characters, or other desired objects, from the background in a scanned 
gray-scale image, with the pixels that form each character, or object, being one value, typically that for black, and the 
pixels for the background all being another value, typically that for white. For ease of reference, we will hereinafter 
collectively refer to each character or other desired object in the image as simply an "object". 

Ideally, the best thresholding process is one which accurately selects all the object pixels, but nothing more, in the 

45 scanned image and maps those pixels to a common single-bit value, such as, e.g., "one" for black. In practice, noise, 
background shading, lighting non-uniformities in a scanning process and other such phenomena, preclude the use of a 
single fixed threshold for an entire image. In that regard, if the threshold is too low, the resulting image may contain an 
excessive amount of noise in certain, if not all regions; or, if too high, insufficient image detail, again in certain, if not all, 
regions - thereby complicating the subsequent processing of this image! Given this, the art recognizes that a preferred 

so approach would be to select a different threshold value that is appropriate to each and every pixel in the scanned image. 
In doing so. the proper threshold value is determined based upon local properties of the image, i.e.. certain image 
characteristics that occur in a localized image region for that pixel. Hence, the threshold would vary across the image, 
possibly even adapt to changing localized image conditions. 

In general, a common methodology for variable thresholding relies on measuring localized image characteristics. 

55 such as local intensity contrast (or gradient), local averaged intensity and/or local variance, in a local window centered 
about a pixel of interest and then using these measures to classify image pixels into either object pixels, black, or back- 
ground pixels, white. Here, too. reality diverges from the ideal inasmuch this methodology is complicated, and often 
frustrated, by a need to extract various objects in a wide range of documents but with minimal user intervention, such 
as for purposes of initialization and object identification, and while still yielding a clean background in the thresholded 
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ima oe InreaUty these objects may include, e.g.. dim. broken objects; and objects MM present a relatively lo* contrast, 
■mage. In reality. ne* e j objects embedded in a black background. 

" T lelesTSn the ove^elming nabiljy of fixed threshing to provide adequate performance wi* actual 
ha S °by teaching several variable thresholding approaches that attempt to provide satsfadojy 
images, the art has Persistjc my tea ^ g drawb a C ks that, in practice, tend to limrttheir ulilrty. 

perlormance. Howeveu drfferent localized image properties, are taught in M. Kamel et 
, ^^S^^^S^^ from Grayscale Document Images'. CVfflP- Graphical Models and 
al.. "Extraction of ajWJJJ ^993 pages 203-217. Here, a logical level" approach is based on comparing a 
mae&sce&m. v °'; ^ ^Mte srShSa'ra^tvel (if the image is noisy) with four local averages in neighborhoods 

centered about four pixels « x * . approach, so-called "mask-based subtraction", rel.es 

a.l tour .oca. averages men the J£ ge and a character/graphics image. First, most of 

on considering every pixel . ■ «JJ 8 » 1unctionj t0 remove >ar1icle . noise . The 

f,lter is a PP ,, ^~^ X ^V7 esumng Ottered" binary image contains character/graphics pixels which are black and 
of 0. n/4. rJ2 and »£™ hereafter this tittered, or "mask", image is modified by detecting additional back- 
background pixels ^ I dth and then for every possible character/graphics pixel, the gray level 
ground pixels "^^K^^ Lastly. a gray-sca.e character/graph- 

of its ^'^"^^^J'C^^LxJdba^ound image from the original scanned image with resulting 
ios image •*^^ | SKSS2^yWd a binary character/graphics image. Though the global threshoW 
differences ther J between this threshold and a varying background value essentially 
va .ue ^^^i^^SuBi firs. b.ush. these two approaches would appear to be somewhat immune 
\^Z T*c^T*** <* *~ approaches is highly sensttive to no*e and often resutts in a noisy background when 

^^p" ini±ss 

Another approacn. * ac cinnpp hereon relies on tracking a background value in an image, with a threshold 

19 1989 and I owned* and a feedback signal. Here, whenever a transition 
Va ' Ue b X1mTe tcn^an e?gt the teedback signal is momentarily varied in a pre-defined pattern to momentary 
ml^^hoW^esuch^S ostensibly an output tittered thresholds pixel value has a reduced ncHsecontent 
modify the threshold value msu c " ™» exhibits boundary artifacts at intensity transitions due to abrupt changes 

"ou^ 

ties in detecting low ^trastc^cts_ 4.468.704 (issued to J.C. Stoffel et al on August 28. 1 984). 

A further ^°^^^^^l^^ ge offset potential, which is obtained on a pixel-by-pixel 
Here, adaptive thresholding « ^S^S^SwB.li in the image. This offset potential is used in conjunction 

wrth nearest neighboi -f*x* M ° b 7 domparing the image potential of that pixel with predetermined minimum 

wfJpXa^ Un » rtunatel » «• a,S ° aPPCare t0 exWbtt dmCUttieS ' n 

exacting low contrast ^^JrtS^ specHically apparatus and an accompanying method, for accu- 
Therefore, a n ^J^"!^ r *™£ JSSStoS objects therein using a threshold that varies based on local 
ra«e.y and reliably !^^^ Jise immunity and reduced boundary artifacts, as well as 

"ZSSS^SX^ c^ objects, than do conventional v„riabfe thresholding techniques. 

4£ BlSffl OQI ,RF nF THE 'N VENT| Q N 

We have substantially and advantageously overcome the deficiencies in the art through our inventive mutti-window 

thresholding technique. «.u„ « r oHiont an n is determined for each pixelfij) in an incoming 

SpecHically. *^f^^ 3-b%.c, pixels centered 

so scanned gray-scale bit-mapped image, n his grao.eni is detected for an N-by-N pixel window 
about pixel(ij). Next, minimum intensity. ^ ™^ a sum of individual intensity 

centered over the f^^^ about position (i.j). 

™ er £h^ 
gradient. GS(i j). 
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Using these local measures, each image pixel(ij) is first classified as being near an edge or not. In that regard, each 
nixel near an edge is so classified if the local measures therefor, specifically its associated area gradient, exceeds a 
ore-defined threshold value. Otherwise, that pixel is viewed as falling within a locally flat field, i.e.. of relatively constant 
aray-scale Specifically, edge pixels are classified by comparing their intensity values against average intensity values 
for local N-by-N pixel windows therefor; in contrast, pixels in flat fields are classrf ied by comparing their intensity against 
a predetermined intensity value. 

In particular, an image pixel(i j) located in a vicinity of an edge in the image is detected whenever the area gradient. 
GS(i D for that pixel is high, particularly being larger than a predefined gradient threshold. GT. Once such a pixel is 
found each pixel on a darker side of the edge is found. This latter pixel being in the center of a local N-by-N, e.g., 7-by- 
7 pixel window is classified as an object pixel, i.e.. black, whenever its intensity. Lc, is smaller than an average of its 
associated L • and L max values. Alternatively, if the intensity of this latter pixel, tc. exceeds the average intensity value 
ofL and Lm then this particular pixel is classified as being background, i.e., white. 

Where the*area gradient. GS(ij). for pixel(t.j) is less than the gradient threshold, such as where the N-by-N window 
slides along an image region of relatively constant gray-scale, pixel(ij) is classified by simple thresholding, i.e., thresh- 
olding its gray-scale value against a constant predefined threshold value, IT. Here, if the gray-scale value of pixel(i.j) is 
less than the threshold value IT, this pixel is classified as an object pixel, i.e., black; otherwise, this pixel is designated 
as part of the background, i.e.. white. 

To enhance image edges and by doing so further increase thresholding accuracy, the intensity values of those pixels 
near an edge are modified. For the preferred embodiment, black pixels are illustratively represented by a value of zero 
and white pixels by a value of "1 Specifically, for those pixels located on a darker side of an edge and having an intensity 
value that exceeds the threshold value. IT, the intensity of each of these pixels is reduced to a value slightly below 
thr shold IT. e.g.. to a value IT-. In contrast, the intensity values for those pixels, located on a brighter side of an edge 
and having an intensity value less than threshold value IT, are increased to an intensity value slightly greater than thresh- 
old value IT, e.g.. to value IT+. A high quality binary image can then be produced by merely thresholding the resulting 
gray-scale image against pre-defined threshold value IT. ....... 

Advantageously, through the use of area (summed intensity) gradients, the present invention signrficantly reduces 
both background noise and boundary artifact generation while accurately extracting objects in a thresholded image. The 
modified gray-scale image, when subsequently thresholded using a fixed threshold value IT. greatly enhances detection 
of low contrast objects and thus readily produces a high quality binary image. 

RRIFF DESCR IPTION OF THE DRAWINGS 

The teachings of the present invention may be readily understood by considering the following detailed description 
in conjunction with the accompanying drawings, in which: 

FIG. 1 depicts a high-level block diagram of document imaging system 5 that embodies the teachings of our present 
inv ntion; 

FIG. 2 depicts the correct alignment ol the drawing sheets for FIGs. 2A and 2B; 

FIGs. 2A and 2B collectively depict a high-level flowchart of our inventive multi-windowing thresholding method; 

FIG. 3A depicts a 3-by-3 matrix of pixel location definitions which we use in determining, through a "Sobel" gradient 
operator, gradient strength for current pixel(i.j) ; 

FIG. 3B depicts a graphical representation of an N-by-N neighborhood of pixels centered about a current pixel(ij) 
in an image intensity record; 

FIG. 3C depicts a graphical representation of an (N-2)-by-(N-2) neighborhood of pixels centered about pixel position 
so (i,j) in an image intensity gradient record; 

FIG. 4 graphically depicts intensity profile 400 of an illustrative scan line of an original gray-scale image, such as 
that illustratively on document 10 shown in FIG. 1 ; 

55 FIG. 5 graphically depicts a modified intensity profile which results after thresholding intensity profile 400 shown in 
FIg! 4 according to the teachings of our present invention; 

FIG. 6 depicts a block diagram of a preferred embodiment of our present invention; 
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FIG. 7 depicts a block diagram of Sum of Gradients Circuit 620 which forms part of circuit 600 shown in FIG. 6; and 

FIG. 8 depicts a block diagram of 7-by-7 Maximum and Minimum Detector 635 which also forms part of circuit 600 
shown in FIG. 6. 

To facilitate understanding, identical re1e.er.ee numerate have been used, where possible, to denote identical ele- 
ments that are common to various figures. 

MQDEg OF CARRYING OUT THE INVENTION 

Afler reading the following description, those skilled in the art will quickly realize that our invention is applicable for 
„p fn neanv any document imaging system lor accurately thresholding scanned documentary .mages cxj.ntam.ng hne 
Z Z reaard"^f what that Sne art contains and the media upon which the image origJnaHy appears. For example. 

Is letters and numbers, which we will collectively refer to hereinafter as "objects • . 

To^urposes of illustration and discussion, we define black and whrte grayscale pxels as hav.ng eight-b 1 gray- 
. LK-O- and "255" respectively. For consistency, we also def ine output binary pixels wrth black b«ng "0" and 

S o mequaS that compare pixel intensities, as would be readily apparent to those skilled in the art. reversed from those 

^G^d^ctTatlgh-level b.ock diagram of document imaging system 5 that embodies the teachings of our present 
1- ! SL svstem 5 is formed of gray-scale scanner 20. digital image processor 30 and "s.mple" thresholds 
.nvent.on. As s ^ n ^ yS !^^ m ^ n ^ various obje cts of interest is scanned by scanner 20 to produce multi- 
circurt 40 n operaton doc^ ™ £ data routed ^ leads 25 . to a data input of digrtal 

floTpTc^wScT n^r^ 9 tothTexlenl relevant'here. thresholds the scanned image <teta and generates bo* 
L^S^Sscale image on leads 37 and an output binary image on leads 33. The modrf.ed gray-scale .mage, as 
* K^ifaTeaS deS below has its edges enhanced by processor 30 to subsequently increase tr.reshold.ng aooj- 
^SSS ^ciy scate image data appearing on leads 37 is rouled to an input of thresholding circurt 40 wh,* 
racy. The modrf.ed gray scaie : .mage aa ^ ^ pre-defined fixed threshold value thereby 

P TL»S ir?s ano^output binary image. Digital image processor 30 performs thresholdmg .n accordance wrth our 
v« 'eads 4 5.^ h dat y a lhr o ughpul s required tor processing bit-mapped images, processor 30 « prefer- 
^^^o^^^le cir?uS inking our inventive circuit 600 shown in FIG. 6 and discussed in 
o^ifS Arte™ SSy. this processor can be implemented through one or more general purpose m.croprc^esso^, 
S abated memory and supporting circurtry and suitably programmed to implement our presert invention .n soft- 
ie thatTe n^cropr^ess^s) could execute instructions sufficient* fast to produce the requ-srte data 

throughput. mult i-windowing thresholding method 200 is collectively depicted in FIGs. 2A 

and 2B Tr wNc^e correct alignment of the drawing sheets tor these figures is shown in FIG. 2. Dur.ng the course of 
ana *u. ro> reference will also be made to FIGs. 3A-3C where appropriate. 

"'"Sod 200^ Pert o med T^ch and every grayscale pixel in an incoming unage. thereby iterating trough the 
•™ a ?Snte ie^nner through which each gray-scale image pixel is processed is identical across aH such p.xe.s. we 
Smere v discusTSs routine in the context of processing generalized pixe.<ij), where i and j are posrt^e .ntegers > and 
Hi^xe. Snterand vertical indices within the incoming grayscale image. In addition to grayscale .mage data. 
t Z suilS valJes of two input parameters, specifically fixed thresholds IT and GT. These thresholds are adjusted. 
« d1sc£ be^ ?r.heMo Provide satisfactory thresholding across a multitude of drffering images wrth vary.ng char- 
^eStiiluch 7s contrast and illumination, or can be adjusted to provide optimum threshold.ng for any one type of 
acter.st.^ such as conira performing method 200 on all such grayscale .mage p.xels. 

im t 9e Tr2Z or Sf o a ZZ .mage tiltoe generated, wSh each pixel in the resulting image corresponding 
oa p.xel -n he .ncommg a predefined threshold, having a value IT. as that value 

SbeSe^ graySle image can then be further processed, e.g.. through scaling or fil.er.ng. as 

deSi |f™r.irui a r uoon starting method 200. step 203 is first performed. This step determines a gradient strength for 
55 ■ ,T-K fe acSmShed ftoy use of the so-called "Sobel" gradient operator on a window of pixels centered about 

SnlquaSl )-(3) below, the Sobel operator relies on computing horizontal and vertical p.xel .ntensrty grad.ents. 
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GX(i j) and GY(i j), respectively, and for each pixel position (i j) forming the gradient strength. G(i j) as an absolute sum 
ofGX(i,j)andGY(ij): 

GX(ij) « L(i+1 + 2L(i+1 j) + L(i+1.j+1) - L(M - 2L(M j) - L(i-1.j+1) (i) 
GY(ij) - L(i-1 j+1) + 2L(ij+1) + L(i+1 j+1) - L(i-1 j-1) - 2L(i,j-1) - L(i+1 i j-1) (2) 
G(i.j) = |GX(ij")| + |GY(ij)| (3) 

where: G(i,j) is the gradient strength at pixel position 
(ij): and 

L(i j) is the image intensity, luminance, at pixel 
position (ij). 

The resulting gradient strength value for every image pixel collectively forms a gradient strength record for the entire 

scanned image. 

Once the gradient strength, G(i j), is determined for pixel(i j), method 200 advances to step 206. This step, when 
performed determines the area gradient for pixel position (i,j), i.e.. GS(ij), as the sum of the gradient strengths for each 
and every pixel position within an (N-2)-by-(N-2) window centered about pixel position (i,j). This window is illustratively 
5-by-5 pixels in size and is shown, as window 370, in FIG. 3Cfor an N-by-N, illustratively 7-by-7, window shown in FIG. 
3B the latter being window 330, centered about pixe!(i,j). For purposes of comparison, the periphery of ah N-by-N 
window is shown as window 350 in FIG. 3C. Though shown sequentially to simplify illustration, step 245 is generally 
performed at essentially the same time, as symbolized by dashed line 244, as is steps 203 and 206. Step 244 determines 
both minimum and maximum pixel intensity values. L^ n and L^,, respectively, in an N-by-N window centered about 

PI * e 'once the area gradient is determined for pixel position (i j). then, through the remainder of method 200, the three 
image measures. GS(ij) and and L^. associated with this particular pixel position are used to classify pixelCU) 
as an object pixel, i.e.. black, or a background pixel, i.e.. white. 

In particular, decision step 208 is performed to assess whether the value of the area gradient for pixel (ij), i.e.. GS(i.j>. 
exceeds a pre-defined threshold value. GT. or not. This test determines whether pixel(ij) lies in a vicinity of an edge in 
the scanned image, or not. If the area gradient is less than the threshold value. GT, then pixel(i,j) does not lie near an 
edge Hence pixel(ij) lies within a localized relatively constant tone. i.e.. Hat" field, region of the scanned gray-stale 
image. In this case, method 200 advances along NO path 21 1 emanating from decision block 208 to decision block 215. 
This latter decision block determines whether the gray-scale intensity of pixel(i.j), i.e.. I^, exceeds another pre-defined 
threshold value, IT. If this gray-scale value is less than the threshold, the pixel is classified as an object pixel, i.e., here 
black- otherwise, the pixel is classified as a background pixel, i.e., here white. Specifically, on the one hand, if pixel 
intensity, Lc. is less than or equal to threshold IT, decision block 215 advances method 200, along NO path 217, to 
decision block 222. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. If a binary pixel is desired, then decision block 222 advances the method, via YES 
path 225 to block 228 which, in turn, sets an intensity of output pixel(i.j) to black, i.e.. zero. Alternatively, H a gray-scale 
output is desired, then decision block 222 advances method 200. via NO path 224. to block 230. This latter block, when 
performed, sets the intensity of output pixel(i.j) to the intensity of without any modifications made thereto. If. on the 
other hand, pixel intensity, k. exceeds threshold IT, decision block 215 advances method 200. along YES path 218, to 
decision block 232. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. If a binary pixel is desired, then decision block 232 advances the method, via YES 
path 235 to block 237 which, in turn, sets an intensity of output pixel(i.j) to white, i.e., one. Alternatively, if a gray-scale 
output is desired, then decision block 232 advances method 200, via NO path 234. to block 241 . This latter block, when 
performed, sets the intensity of output pixel(ij) to the intensity of without any modifications made thereto. Once step 
228. 237 or 241 is performed, method 200 is completed for pixel(i.j); the method is then repeated for the next image 
pixel in succession, and so forth. 

Alternatively, if area gradient. GS(i,j) exceeds the threshold value, GT, then pixel(i j) lies in the vicinity of an edge. 
In this case step 245 is now performed to ascertain the values of and L^. if these values have not already been 
determined! as set forth above. Thereafter, method 200 advances to block 248 which calculates an average pixel intensity 
value, Uvg. occurring within the N-by-N pixel window centered about pixel position (ij). This average intensity value is 
simply determined by averaging the values Lmax and l-min- 

Once this average value has been ascertained, method 200 advances to decision block 251 which compares the 
intensity of pixel(ij), i.e.. against its associated average intensity value, L^g. When this pixel is situated on a darker 
side of an edge and hence an object pixel, then its associated average intensity value will be greater than or equal to 
its intensity value, Lc. In this case, the output binary pixel for pixel position (ij) will be set to black. Otherwise, if pixel(ij) 
is situated near an edge but its average intensity value. L avg , would be less than its intensity value, Lc, then this pixel is 
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increase thresholding accuracy. associated average intensity value. 

Inparticular.Hgray*^ 

Uo. then decision block 251 advances method 200, along NO pain gray-scale output pixel. If a 

determines whether a user £ P™«* SJS^STSSfS^S path 265. ?o Week 268 which, in turn, 
binary pixel is desired, then dec* on I tfock 260 ^ce^the met ^ ^ .^.^ dedsjonblock260 
sets an intensity of output pixel(ij) to black. . e.. zero. However nag y ^ performed, determines whether 

advances method 200. via NO path 262 >.£ |d«n Hock ^^^d^U, j,.f less than or equal to threshold 

pixel intensity. Uc. «<^^^S2^?K"^^ when executed, sets the gray-scale output for 
?T. then method 200 advances, along NC £atfi 2 2 to block ,275 > ™™- ^ of th , esnold , T> tnen method 

value that is slightly less. I.e IT-, than threshold I IT associ a te d average intensity value. 1-™. thendecision block 

Alternatively, if gray-scale ^"^-^^^^SSotS latter deciston block determines whether 
251 advances method 200. along YES path pixel. If a binary pixel is desired, 

a user has previously instructed *™*^£^£Z£ £52* 285 which, in turn, sets an intensity of output 
then decision block ^^^^^^^^6. tnen decision bloc* 280 advances method 200. 
pixel(ij) to white. ..e.. one. However . rf oc k when performed, determines whether pixel intensity. is less 

via NO path 284. to deas.on block 290. Th.s ' atte ^°^ ! f e "° 1 fe , to threshold IT. then method 200 
than pre-defined threshold value IT. « the .ntens.ty ^ P' xe ^^ ^The g^y-scale output for pixel(i j) to Ihe pixel 
advances, along NO f*»*^JS££ S^SSm^ method 20^dvancL. along YES path 
intensity va.ue U- Now rf the rtensrty rf p.xel( jD « esstta ^ value that is slightly larger. i.e, IT-. 

294. to block 297. which, when executed^ he gray - seal . ou** or P threshold IT is not critical 

XZ^VZX^S^tZ 2-£SS " - been performed to ' pixe,0j): mettrod 200 te 

then repeated for the next image jpixel ^SJ^J^SS^ depicts intensity profile 400 of an illustrative scan 
With this understanding ^^^^^gSSSLS 1 0 shown in FIG. 1 . As a result of processing 
line of an original gray-scale .mage. jjxh ^ ^e^rty profile 500. shown in FIG. 5. results. As is readily apparent 
profile 400 through our inventive method modified '"tensny premie ^ jed 

Som comparing profiles 400 a ^<^^ Hs intensity is larger than threshold 

In this regard, when such a pixel . t tocatad lm a darker ^ ™ 8 " ^^^e^ow , T Alternatively, when an image 
value IT. the intensity of this pixel * decreased U> a va ue J IT shghfly ^ na ^ ^ fe 

This^^^ 

averaging circuit 640; muMexors 680 a £^ a £ g *ZZTn£ 2A and 2B and discussed in detail above. 
675. Circuit 600 collectively .implements method 200 depict ed I in Fl«. k an ^ 

,n operation, eight-bh input **^^^J££. eTo^ecHcaUy series^onnected single 

„ne delays 610 and to an input rf 7-by-7 Max and ^f^^.^^Ja centered aroundimage P ixe.(iJ). 
line delay elements 610,. 610 2 and 610* Hne of the window. The outputs of all three 

Each delay element is tapped at three ^^J^^^^Z^S which, through calculation of the Sobe. 

, delay elements are routed to respective inputs of grad,ent ° rcu « Q(ij) on ou ^ A leads 6 17. This 

operator, as described above, provides an .ntensrty grad ent ^J^** ^ uTO ^1 the graTnt intensity values 
gradient va.ue is applied to an input .0 ^JX^J^^^^ ^ as an a ' ea ***** va,Ue ' Le > 
fn a 5-by-5 window centered about p.xe pos,l ,on ^^^^ZZssed in detail below. This area gradient value 
GS(ij).on i2-bKoutput.eads622. Circuit ^*^T^"°aI thte S against pre-defined gradient threshold. GT: 

s is iplied to an <"«^^ comparison output, appearing on 

the latter being applied to another input ( B >*™™T£^ n ^ OT not . The comparison output is applied as an 

ttJESSSl ^T^X^X^ - — - * — — - — 

gray-scale output, i.e.. k or one of the predefined values IT- or IT + . 
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Max and Min detector 635. depicted in FIG. 8 and discussed in detail below, ascertains the maximum and minimum 
pixel intensity values, i.e.. L max and L min . of those pixels contained within a 7-by-7 window centered about pixelfl j). 
Resulting L and L min values, appearing on leads 637 and 639. are then averaged by averaging circuit 640. This circuit 
contains summer 642, which merely adds these two values together into a resultant sum. and 12 and delay circuit 646 
which implements a divide by two operation by shifting the sum one-bit to the right; hence dropping the least significant 
bit The resultant averaged intensity value is then appropriately delayed within circuit 646 for purposes of achieving 
proper synchronization within circuit 600. 

If the area gradient value. GS(ij), exceeds the pre-defined gradient threshold. GT, then the averaged intensity value, 
L used as a threshold level in determining the value of output center pixel(ij); else, if the area gradient value is less 
than or equal to the gradient threshold, then value IT is used as the threshold level in determining the value of output 
center pixel(ij). In this regard, the averaged intensity value, L avg , is applied to one input of comparator 650. The current 
center pixel value. Uc. is applied through delay element 630 to another input of comparator 650 as well as to one input 
of comparator 655. The delay of element 630 is set to assure that the proper corresponding values of 1^ and La vg are 
synchronously applied to collectively comparators 650 and 655. multiplexors 680 and 690. and gating circuit 660. The 
fixed threshold value, IT, is applied to another input of comparator 655. Comparator 650 produces a low or high or low 
level output on lead 653 if pixel intensity value Lc is less than, or equal to or greater than, respectively, its associated 
averaged intensity value. L^g. This output level is applied to both one input ("1") of multiplexor 680 and one input of 
AND gate 662. Multiplexor 680 selects the output binary pixel as between two bits generated from using two different 
thresholds, i.e., value IT or L avg . In that regard, comparator 655 produces a high output level at its A>B output whenever 
the predefined threshold value IT is greater than pixel intensity value 1^. The level at output A>B is routed, via leads 
657 to another input C0 W ) of multiplexor 680 and to one input of AND gate 670. Multiplexor 680 produces the output 
binary value for pixel(ij). To do so, the comparison output level produced by comparator 625, indicative of whether the 
area gradient, GS(i.j).exceeds the gradient threshold. GT, is applied as a select signal, via lead 628, to a select fS") 
input of multiplexor 680. If this select signal is high, thereby indicating that the area gradient exceeds the gradient thresh- 
old, then multiplexor 680 routes the level applied to its "1* input, i.e.. that generated by comparator 650. to binary output 
lead 33- else, the multiplexor routes the level then appearing at its "0" input to lead 33. 

Multiplexor 690 routes the center pixel value. L^. or a predefined value IT+ or IT- as the output gray-scale value to 
eight-bit output leads 37. Pre-defined values IT+ and IT- along with center pixel value Uc are applied to different corre- 
sponding eight-bit inputs to multiplexor 690. The output of this multiplexor is determined by the state of two select signals. 
SO and SI" if these signals assume the binary values zero, T or "3", then the multiplexor will route center pixel value 
U value IT+ or value IT-, respectively, to output leads 37. Gating circuit 660. in response to the comparison output 
sTonals produced by comparators 625. 650 and 655 generates, by simple combinatorial logic, the two select signals, SO 
and SI applied to multiplexor 690. Specifically, if the area gradient is not larger than the gradient threshold, then, as 
discussed above, gray-scale pxel value Uc. without any modification, will be applied through multiplexor 690, to output 
leads 37 Alternatively, if the area gradient exceeds the gradient threshold, then multiplexor 690 will apply either value 
IT+ or IT- as a modified gray-scale output value to lead 37 based upon whether center pixel intensity value Uc is less 
than its corresponding average pixel intensity value, U aV g. or not. Since the operation of gating circuit 660 is serf-evident 
from the above description to anyone skilled in the art, it will not be discussed in any further detail. Values IT- and IT+ 
are merely held in appropriate registers (not shown) and from there applied to corresponding inputs of multiplexor 690. 

FIG 7 depicts a block diagram of Sum of Gradients Circuit 620 which forms part of circuit 600 shown in FIG. 6. 
Circuit 620 as shown in FIG. 7. is formed of adder 710 and line delays 720. the latter containing four series-connected 
single-line delay elements 720 0 . 720 1f 720 2 and 720 3 . Incoming gradient intensity values are applied, over lead 617 
from gradient detection circuit 615 (shown in FIG. 6), to one input of adder 710. shown in FIG. 7. In addition, the output 
of each of the four line delays 720 is routed over a different one of four feedback leads 715 to a different corresponding 
input of this adder. As a result, adder 710 forms partial sums of 1-by-5 gradient values which are applied, as input, to 
delay element 720 0 . To generate a 5-by-5 sum of the intensity gradients, hence producing an area gradient measure, 
four preceding corresponding 1 -by-5 sums are added together within adder 710 to a present 1 -by-5 sum. with the result- 
ant overall 5-by-5 sum being applied to output lead 622. Use of these delay elements connected in this fashion substan- 
tially reduces the amount of logic that would otherwise be needed to generate twenty-five intermediate sums. 

FIG. 8 depicts a block diagram of 7-by-7 Maximum and Minimum Detector 635 which also forms part of circuit 600 
shown in FIG. 6. As noted above, detector 635 determines the maximum and minimum pixel intensity values. and 
L_ respectively, within a 7-by-7 window centered about pixel positionfi.i). This detector is formed of two separate 
circuits- maximum detecting circuit 805 and minimum detecting circuit 840 which extract a pix - 1 value having a maximum 
and minimum intensity value, respectively, of all the pixels contained within the 7-by-7 window centered around pixel(i j) 
and respectively apply those two values to leads 637 and 639. Circuit 805 contains maximum detector 810 and line 
delays 820. the latter being formed of six series-connected single-line delay elements 820 0 , 820i. 820 2 . 820 3 . 820 4 and 
820 5 Similarly, circuit 840 contains minimum detector 850 and line delays 860, the latter being formed of six series- 
connected single-line delay elements 860 0 . 860 1t 860 2 , 860 3 . 860 4 and 860 5 . Inasmuch as circuits 805 and 840 function 
is an identical manner with the exception that detector 81 0 locates maximum values while detector 850 locates a minimum 



8 



EP 0 712 094 A2 



values both from the same series of pixel intensity values applied over lead 608. we will only discuss court 805 m detail, 
thr^f s^ed n7he art will then readily appreciate how circuit 840 functions from th.s d.scuss.on. 

^oZ pxeltn ensity values are applied, over lead 608 to one input of maximum detector 81* In addition ,9m 
J?2S Sthlsbc line delays 820 is routed over a different one of six feedback leads to a respectrve .nput of the 
dSc? So^uentiy" mTximum detector 810 extracts the maximum pixel intensrty value from a 1 *y-7 series of pixel 
JleTarS 2 2 maximum value as input to line delay element 820 6 . In conjunction w,th six preceding corre- 
^inTlSum pixel values stored in line delays 820. maximum detector 810 locates the max.mum pixel .rrtensi* 
spend ng maximum P* * ^ ^ and lies ^ value> as output, to leads 637. Here, the use of 

^Z^^^^^TsSar^ 860^stantia»y reduces the amount of logic.that would otherv.se be needed 
_ „ n ~ ra « p intermediate maximum and minimum pixel intensity values. 

9 i one 2n now a^reciate. the quality of the resulting thresholded image produced by our .rwent-on -s control^ 
, rZi rnVthr esholds GT and IT Threshold GT is set to properly classify image pixels located in the vicinity 

Setn^ni^ 

9 h! n -is Std in^mage regions of uniform tone. i.e.. Hat" fields. In that regard, a low IT value tends to classify 

P l!^r^milar to "simple" thresholding with a fixed threshold IT In this situation, the value of threshold GT 
PU .l dwfvsCamer than ie areTgradient. thereby causing our invention to classify pixels by essentially solely com- 
W r r2L^ ^^oe Dixe! of interest against fixed threshold IT. Our invention can also produce an outline .mage through 

st;'^ 

• , t K«,r a «ociated area qradients are larger than the gradient threshold. GT. 

P AJt^h one pSenedlrnSfLnt of the present invention has been shown and described in detaB herein, many 
otheS emloS^ms that incorporate the teachings of the invention may be easny constructed by those skrt.ed .n 
the art 

■uni IRTRIAI A »=P| ir.ARILITV AND ADVANTAGES 

Th* oresent invention is useful in image scanning systems and particularly for thresholding scanned gray-scale 
imaa ^ cc^Ta^g S art. The invention advantageously provides more accurate thresholding than that piously 
SSb'et The art Sy exhibiting enhanced immunrty to background noise in the scanned image and substantially 
so eliminating boundary artifacts from a thresholded image. 

Claims 

< a mpthnd for thresholding an input gray-scale image into a first output image, wherein said input image is formed 

fe Et&TpZ* each Ling a corresponding output pixel value associated therewrth. characterized .n that the 

^^eTS in SSnle to a plurality of pixel values in the input image, an area gradient value for a fist 
windoJ *mS or^re-defined size, of intensity gradient values associated with ones of the pixel values centered 
=k™ rt a nirrpnt Dixel position, (i,j) (where i and j are pixel indices), in the input image; 
40 abOUt d S mining in^espense to a second window (330). of pre-defined size, of input pixels centered abo* : *. 
cunent ptTposition (i.j) inthe input image, a maximum pixel value and a minimum pixel value of the pixel values 
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mntained within said second window; and ... ♦ „ 4 

Se^oSng in response to the area gradient value and the maximum and minimum p.xel values, a current. >ne of 
Ine^xef vates for the current pixel position (i.j) in the input image into an output pixel value at a correspond^ 
position in the first output image. 

The method of claim 1 wherein the area gradient ascertaining step comprises the steps of: 

measurim in response to a third window (370). of the pixel values, of a pre-def ined size and centered about 
the corZ position <i.j) in the input image, an intensity gradient. G(i.j). value for the current p.xe. position such 
ttet Z \ ^succession of input pixels in the input image, a pluralrty of corresponding intensity gradient values is 

de,ined fo?rrring the first window of intensrty gradient values from the plurality of intensity gradient values and. in 
response to said first window, determining said area gradient value. 

The method in claim 2 wherein said first window forming and area gradient determining step ^^^^ 01 
^mm^ng Individual ones of the intensity gradient values in said first window to produce the area gradient value. 
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The method in claim 3 wherein said thresholding step comprises the steps of: 

comparing the area gradient value, GS(i j), to a pre-defined gradient threshold value, GT. so as to produce 
a comparison signal indicative of whether a current one input pixel located at the current pixel position (ij) in the 
input image lies in a vicinity of an edge in said input image; 

if the comparison signal indicates that said current one input pixel does not lie in the vicinity of the edge, 
thresholding in response to said comparison signal, said input pixel value at position (i j) in the input image against 
a pre-defined fixed gray-scale threshold value. IT. to yield a corresponding binary output value at pixel position (ij) 

in the first output image; and 

if the comparison signal indicates that said one current input pixel does lie within the vicinity of an edge, 
setting in response to said comparison signal, said corresponding binary output value to a first or second state If 
the current one pixel value either exceeds or is less than an average of the maximum and minimum pixel values, 
respectively. 

The method in claims 3 or 4 wherein said intensity gradient measuring step comprises the step of producing the 
intensity gradient value for the current pixel position by processing said third window of input pixel values through 
a Sobel operator. 

The method in claim 5 wherein the third window is a 3-by-3 window of input pixel values and the Sobel operator 
implements the following equations: 

GX(ij) = L(i+1 j-1) + 2L(i+1j) + L(i+1.j+1) - L0-1 J-1) - 2L(i-1 j) - L(i-1 j+1); 

GY(ij) = L(i-1.j+1) + 2L(ij+1) + L(i+1 j+1) - L(M j-1) - 2L(ij-1) - L(i+1,j-1); 

and 

G(ij) = |GX(ij)| + |GY(i,D| 

where: L(i j) is an input pixel value, in luminance, for 
the pixel position (i j) in the input image. 

The method in claim 4 further comprising the steps of: _ 

producing, in response to said comparison signal and the pixel value (Lc) at the current pixel position (i j), a 
modified gray-scale output pixel value at position (ij) in a second output image, wherein. H the comparison signal 
indicates that the one current input pixel does not lie in the vicinity of the edge in said input image, the modified 
qray-scale output pixel value is set equal to the pixel value Lc. or if the comparison signal indicates that the current 
one input pixel does lie in the vicinity of the edge, the modified gray-scale output pixel value is set equal to a pre- 
defined value greater than or less than the threshold value IT when the pixel value Lc is greater or less than, respec- 
tively, the threshold value IT; and 

thresholding the modified gray -scale output pixel value, against the threshold value IT. to yield a corresponding 
second binary output pixel value at a pixel position (i,j) within a second output image. 

Apparatus for thresholding an input gray-scale image into a first output image, wherein said input image is formed 
of input pixels each having a multi-bit gray-scale pixel value associated therewith, and wherein said first output image 
is formed of pixels each having a corresponding output pixel value associated therewith, characterized in that the 
apparatus utilizes the steps recHed in any of claims 1-7. 



10 



EP 0 712 094 A2 



DOCUMENT 



30 



SCANNER 



20 



GREY SCALE 

IMAGE 

DATA 



DIGITAL IMAGE 
PROCESSOR 

(THRESHOLDING & 
GREY SCALE 
MODIFICATION) 



37— 



33 



) BINARY 
IMAGE 



MODIFIED 
GREY SCALE 
IMAGE 



SIMPLE 

THRESHOLDING 
CIRCUIT 



T 

40 



45 



BINARY 
IMAGE 



FIG. 1 



EP 0 712 094 A2 



r 



i 

Y 

I 



— 244 



200 



DETECT Lmox 
ond L m i n in 

N x N WINDOW 
CENTERED ABOUT 
PIXEL (i,j) 





Q 


<) START 




L 


GREY SCALE 
, IMAGE DATA, L 


203 ~ 


CALCULATE 
GRADIENT 
STRENGTH FOR 
PIXEL (i,j) 








206 

I 

I 


CALCULATE THE 
SUM OF GRADIENT 
STRENGTH (GS) 
IN (N-2) X (N-2) 
WINDOW CENTERED ON 
PIXEL 



230 



FIG. 2A 



SET 

INTENSITY OF OUTPUT 
PIXEL TO 
INTENSITY OF 
PIXEL IN CENTER 
OF WINDOW, Lc 



EXIT 





SET 

INTENSITY 
OF OUTPUT 
PIXEL (i.j) 
TO 

BLACK 



END 



I YES 
1BH 




—245 



! FIG. ! 
I2A : 



I FIG. 

;2B 



CALCULATE 

(Lmox + Lmin) 
Lovg = : — 



234— 


232 


SET 




INTENSITY OF 


PIXEL (i,j) TO 


! L C 

i 
1 





235 ! SET 
• i INTENSITY 

% OF OJTFU^ 

; PIXEL (i.j) 

jTO 

WHITE 



en: 



— 241 



FIG. 2 



245 




END 



12 



EP 0 712 094 A2 




FIG. 2B 



SET 

INTENSITY 
OF OUTPUT 
PIXEL (i.j) 
TO 

BLACK 
I 



6 

END 



SET 
INTENSITY 



Lc>IT? >j-^i OF OUTPUT 



PIXEL (i.i) 
TO L c 



297— 



SET INTENSITY 
OF OUTPUT 
PIXEL (i,j) 
TO IT+ 




YES 


270 


274 — 




SET INTENSITY 


OF OUTPUT 


PIXEL (i.j) 


TO IT- 





r 

275 



END 



I 



END 



—278 



END 



13 



O0CI& «6P 0712(»«A2J_> 



EP 0 712 094 A2 



FIG. 3A 



FIG. 3B 



O-i) 

ti-0 


• 

i 

(i-D 




o-i) 

• 

1 




0+1) 

• 


(i+i) 


• 

(i+'i) 


n 





310 



330 



N 



N 



FIG. 3C 





N-2 




N-2 




























0.i) 

































■350 



-370 



14 



EP 0 712 094 A2 




EP 0 712 094 A2 




16 



EP0 712 094 A2 



GRADIENT 
INPUT 




FIG. 7 



17 



EP 0 712 094 A2 




18 



(19) 



CO 
< 




(12) 



EuropSisches Patentamt 
European Patent Office 
Office eur pten des brevets OD EP 0 712 094 A3 

EUROPEAN PATENT APPLICATION 

(51) int. CL 6 : G06T 5/00 



(88) Date of publication A3: 

10.07.1996 Bulletin 1996/28 

(43) Date of publication A2: 

15.05.1996 Bulletin 1996/20 

(21) Application number: 95202928.8 

(22) Date of filing: 30.10.1995 

(84) Designated Contracting States: 
DE FR GB IT 

(30) Priority: 10.11.1994 US 338051 

(71) Applicant: EASTMAN KODAK COMPANY 
Rochester, New York 14650-2201 (US) 

(72) Inventors: 

• Lee, Yongchun, 
c/o Eastman Kodak Co. 
Rochester, New York 14650-2201 (US) 



Basile, Joseph M., 
c/o Eastman Kodak Co. 
Rochester, New York 14650-2201 (US) 
. Rudak, Peter, 
c/o Eastman Kodak Co. 
Rochester, New York 14650-2201 (US) 

(74) Representative: Blickle, K. Werner, Dipl.-lng. et al 
KODAK AKTIENGESELLSCHAFT 
Patentabteilung 
70323 Stuttgart (DE) 
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accompanying method, for accurately thresholding an 
image based on local image properties, specifically lumi- 
nance variations, and particularly such a technique that 
uses multi-windowing for providing enhanced immunity 
to image noise and lessened boundary artifacts. Specif- 
ically a localized intensity gradient. G(i j). is determined 
for a pre-defined window (300) centered about each 
image pixel(i j). Localized minimum and maximum pixel 
intensity measures. L™„ and I™, respectively, are also 
determined for another, though larger, window (330) cen- 
tered about pixel(ij). Also, a localized area gradient 
measure. GS(ij). is determined as a sum of individual 
intensity gradients for a matrix of pixel positions (370) 
centered about pixel position (i.j). Each image pixel(i.j) is 
then classified as being an object pixel, i.e.. black, or a 
background pixel, i.e.. white, based upon its area gradi- 
ent. GS(i.j). and associated L™, and L^x measures. 
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